Title cf the Invention 
ARITHMETIC UNIT 

5 Field of th.€ Invention 

The present invention relates to an arithmetic unit and in 
particular bu- net exclusively to an arithmetic iinir for use in a 
digital signal processor. 

10 

Background cf the Invention 

In known ari'zhT^-ic units, it is possible to calculate a multiply 
accumulate tjs.zz'^on. In a multiply accumulate function, a first 
15 number X is rultiplied by a second nurrJoer Y and added to an 
accumulator ACCU. In mathematical notation^ the result RES of 
this function can be represented as follows: RES=ACCU+X^-Y . 

Reference ii: T.ace to Figure 1 which shows a block diagre.m of an 
20 arithm.etic unir 2 which is able to add together two partial 
products PI and ?2 and the accumulator. The arithmetic unit 2 
has a first, register 4 which stores the value ?1 and a second 
register 6 which stores zhe value P2 . When X is multiplied by Y, 
two partial products PI and P2 result, with Pi + P2 = X * Y- 
25 These values ?1 and P2 are stored in the first and second 
registers 4 and 6 respecuively. 

The current accumulator ACCJ is stored in a third register 3, In 
order to ohiair. the result RES, two further steps are carried 

30 out. The three vectors ?1, P2 and the accumulator ACCU are 
reduced to t:wo vectors Tzhrough a carry save adder stage 10- The 
carry save adder stage 10 actually comprises a number of 
individual 3 to 2 carry save adders each of which provides two 
outputs. One cf chose outputs is a sum output and the other 

35 output is a carry output. The numJoer of carry save adders making 
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uo the carr;,* save adder stage will at least be equal to the 
number of hits of the register Fl. the register P2 or the 
accumulator ACCU. 

5 As discussed hereinbefore, each individual carry save adder 
provides cwo outputs, a carry output and a sum cu'cput . As uhere 
are several carry save adders in parallel, a sum vector and a 
carry vector are obtained. These vectors are input to a global 
adder 12. The global adder 12 provides the result R3S . 

10 

'AhQTL the first value ?1, the second value P2 and the third value 
ACCU are added together, the result RES rray fall outside a 
permitted rc? nge of values. 

15 Summary of the Invention 

It is therefore an aim of embodiments of the present invention to 
ensure that the final result is within a permitted range of 
values. 

20 

According to one aspect of the present invention, there is 
provided an arithmetic unit for adding a plurality of values to 
define a rs^sult, said arithmetic unit comprising means for 
receiving £;aid plurality of values; means for adding said 
25 plurality of values to define a result, said result being within 
a first range; means for determining if said result fall within a 
second rang.^, said second range being smaller than the first 
range; and means for modifying said result in so that the result 
output by s:iid arithmetic -unit falls within the second range, 

30 

In this way, it can be ensured that the output of the arithmetic 
unit always fails within the second range. When the result falls 
outside the second range, the result is sometimes referred to as 
having overflowed. When it is modified to fall inside the second 
35 range, it ii5 referred to as being clamped or saturated. 
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In one eTi^odiment of the present inventicn, three values may be 
added together. Two of those values may be a partial product 
whilst the third of those values x^y be an accumulation result. 

5 

Accordingly, the arithmetic unit embodying the present invention 
can be used £or multiply accumulate functions. 

The adding means may comprise a carry save adder. The carry save 
10 adder may co.-nprise a plurality of 3 to 2 carry save adders. The 
adding mean.;; may comprise an adder stage for providing said 
result. The adder stage may add the two outputs of the caz-r^^' save 
adder stage 'to provide a single resui";:. 

15 The determining means may be arranged to take into account the 
bits other :.han the considered bits. Only some of the bits of 
each of the jplurality of values rr^y be used perform a predecoding 
as to whether or not the result falls outside the second range. A 
bit generateid by the addition of all zhe other bits of the 

20 plurality oi values may also be used. This bit may enaole the 
achievement ' of decoding as to whether or not the result falls 
outside the second range. This may take the form of a carry. 

The second ::'ange can be expressed by N bins and bits N, N-1 and 
25 possibly th^; ISr-2 of at least som.e of the plurality of values may 
be considered. The number of bits which are considered may depend 
on the numliar of values, the range of each value and/or the 
required range of the final result. 

30 The arithmetic unit may be arranged to operate in 2's complement 
binary arithmetic. This means tha- positive and negative numbers 
can be reprt:;sentsd . 



A first of the plurality of values may have an. N bit forroar. and 
may fall in rhe rar.ge -2^"^ to 2^"^-l, The sum of a se::oiid and a 
third of the plurality of values may fall in the range of -2^*'- to 
2^'^, The first range may be -2^' to 2^-1 and the second range T.ay 
be -2^'" to 2"'^-l. h round tray be introduced when generating first 
and second partial product values. This round may conssist of 
adding 2^^^'^ tc the result so that the first range may be -2^' ::o 
2^-1 ^ 2^«^^^--. 

The determining means may consider bits down to N-2 where rhree 
values are used. A fii^rst carry save addition of bits N-2, a 
second carry save addition of bics N-l of all of said plurality 
of values a::d a third carry save addition of bits N froir -wo of 
said plural.-ity of values and bit N-l from a further one of said 
plurality o;; values may be considered by nhe determininc rr.eans- 
The further one of the plurality of values may be an accumulator 
value whilst the other tv;o values may be partial product values. 
Bits of significance greater than or equal to N-l only, resulring 
form the th]:ee carry save additions may be taken into acc:ount by 
the determining means and may be added together by a fourth 
addition, A fifth addition may be performed between the resulting 
bits and a carry generated by the additions of bits N-2 down to 0 
of the three values. If the result of the fifth addition has 
first and second bit:s which are different, then said result falls 
outside said second range. If the first and second bits are rhe 
same, then t:he result falls within the second range. 

The calculation generating the carry used in the fifth addition 
may be perfcrr^ed in the same v;ay as described previously. First a 
carry save addition m.ay be performed on the three values. Then 
the resulting two values may be added together in a final 
addition, T:ie carry used by zhe determining means may be the one 
out column lJ-2 of the final addition, 



The means for m.odifying the result may be arranged to receive 
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information 'as to the sign of the total of two of said plurality 
of values and information as to one bit of a further one of said 
values to di;:terTnine if che result can fall out of said second 
2:ange at th<: positive end thereof or the negative end thereof. 
5 The modifying means nay modify the result zo have a maximum 
positive or T.aximrum nega-ive value of said second range depending 
on which end of the second range the result can fall out of and 
if the result is determined to fall outside said second range. 

10 A plurality- of registers may be provided for storing said 
plurality o.l values. 

Brief Description of the Drawings 

15 For a bettei* understanding of the present invention and ais to how 
the same ma-)- be carried into effect, reference v;ill now be made 
by way of e.>:ample to the accompanying drawings m which :- 

Figure 1 shows a knov;r, arithmetic unit; 
20 Figure 2 shows a block diagram of an arithmetic unit enibodying 
the present invention; and 

Figure 3 il.iustrates the calculation performed by the arithmetic 
unit. 

25 Degcription of the Preferred Smbodiinents 

The embodiment shown in Figure 2 is arranged to carry out a 
multiply aorrumulate function where RES=ACCU-rX*Y. As discussed in 
relation to the figure 1, X is the number to be multiplied and Y 
30 is the multiplier. ACCU is the accumulator value which may be 
increasing or decreasing and RES is the result cf rhe f^inction. 

X and Y are initially multiplied to generate tvc partial products 
PI and P2 . The generation of two partial products from two 
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numbers multi.plied together is well knc-wn ir. the art and v^ill not 
be described further here. The result of the multiplication P, is 
defined as being equal co P1+F2- 

5 As with the arithmetic unit shown m Figure 1, a first register 
14 is provided for storing the firsr partial product Pi- and a 
second regi;£.ter 16 is- provided for storing the second partial 
product P2. 2's complement arithme'ic is used. The first bit of a 
signed nurrJoer indicates if zhe number is positive or negative. 

10 The usual convention is that if the first bic has the value "1" 
then the number is a negative number whereas if the first bit has 
the value then the number is a positive number. 

I 

I A third register 18 is provided for nhe current accumulator ACCU. 

^ 15 The third r=;igister 18 stores a N-bit value. The value of the N 
J bit data w:.ll fall in the range of -2^'' to 2^'^-l, ass it is 

] expressed i:i 2's complement: arithmetic. 

I 

The product P resulting from the sur. of ?l-fP2 will be in the 

'I 

I 20 range -2^*"^ i:o 2^'-. PI may be a N+2 biz negative value and P2 may 
f be a N-f-3 bic positive value. However ?1 and P2 can have other 

I formats. PI, and P2 are such that cheir sum always falls within 
^ . the range -2^"^ to 2^"^. 

25 When the N i.east significant bits of the accumulator are added to 
the product P, the final result will be in the range -2^' to 2^-1. 

A round m.ay also be performed on the final result. In other 
words, the N/2 least significant bits can be cleared. In one 
30 emijodiment of the present invention, a round is performed by 
adding 2^^^^^*^^ to the partial products and then clearing the N/2 
least significant bits from the result RES. Clearing the N/2 
least significant bits can be equated to setting those bits to 
" 0 " , 



35 



7 

The final re>:iult, after rounding, will therefore be in th(- range 
of -2^' to 2^+2 ^''''^'^'-1. However, saturation has to be performed in 
order to kee-p the result within the range -2^'"^ to 2^'^-l. The 
circuitry fo:: ensuring that the result is saturated or is within 
5 the required range will be described in rv,ore detail hereiaafter. 

The output o:: the first second and third registers 14, 16 and 15 
are input tc a carry save adder stage 20. As with the arithmecic 
unit of Figure 1, che carry save adder srage comprises a nuinber 

10 of 3 to 2 carry save adders. The number of carry save adders may 
be eq^jal to N+1, K carry save adders are required if N bits of 
the accumulator ACCU are taken into acco-unt. One further carry 
save adder ils required if overflow detection is considered. Each 
carry save adder receives one input from each of the three 

15 registers l'^ , 16 and IS and provides a carry output and a sum 
ouTiput. Each of these carry and sum outputs are input to a global 
adder stage 22, The global adder stage 22 comprises a member of 
adders each' of v;hich adds a carry ouuput and a sum output to 
provide a respective one of the bits of the result. 

20 

The principal behind overflow decoding will now be discussed- The 
maximum value is 2''+2 -1 , before saturation, in 2's 
complement .-.rithmetic , this value is expressed by N-t-2 bits. The 
minimum before saturation is -2^ which can be expressed in N+l 
25 bits. 

One way of identifying that saturation is required will be 
described w:.th reference to Fig^ure 3 . Numbers in the first range 
-2^ to 2^+2 ^^'"^"^^-1 are expressed on N+2 bits, that is bit 0 to bit 
30 N+1. To write down the most positive values in the range requires 
N+2 bits. Numbers in the second range -2^''^ to 2^''^-l are e:<pressed 
on N bits, i:hat is bit 0 tc bit N-1. 
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To check if there is an overflow the following bits are checked: 
bits g::eater than bit N-1 are eq^ual to bit N-1 if the number 



s 

is in the sjecond range. The bits N-1 and greater shculd be 
checked; and 

bits over bit N+l, thaz is the bit as bits are caur.red 

from 0, wii:, always be rhe same if the number is in the first 
5 range , 

It is thereJiore not necessary to check bits over bit N+I. Bits 
N+l, N and £1-1 should be checked. However, embodiments of T:he 
present invention are able to suppress the check on bit N+1. A 

10 very few numoer of values actually require the N+2 bit fcrmat in 
the first rc'nge. These values are the ones in the range 2^ to 2'""'- 
l-r2^^^"\ The:je values have the property that bits N and N-1 

have the values 0,1 and 0. As bits N and N-l are different for 
these values, it is sufficient to check these two bits to declare 

15 these values! are outside the second range. Thus, bit K-^1 doesn't 
need to be checked for i:he values in the range 2^ to 2^- L-h2^'^^'^ . 

For the otfeir values of the first range, that is values in the 
range -2^ to 2^'-l, these are expressed on N+1 bits onlV/ that is 
20 bits 0 to N, It is therefore sufficient to check bits N-1 and N, 

Reference iii made to Figure 3 which shows how these principles 
can be applied in embodiments of the present invention. The 
partial proclucts PI and P2 are used as is the accumulator ACCU. 
25 The accumulator ACCU has a N bit format, that is it can be 
expressed on N bits, that is 0 to Bit N is a copy of bit N-1 

obtained by carrying out a sign extension. In Figure 3, only bins 
N, N"l and ::l-2 are shown. 

30 A carry save adder stage IGO enables PI, P2 and ACCU to be 
reduced to zi sum vector and a carry vector. 

The values of interest are Carry (N-1), Sum(N-l) , Carry (N) and 
Sum (N) . Carry (N-1) is generated by the carry save adder 100 
35 receiving bits N-2 from the two partial products PI and P2 and 
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bit N-2 from the accumula-cor ACCU. 

Sum(N-l) an:i carry (N) are generated by the carry savei adder 
receiving bits N-1 fron the partial products PI and F2 an,d ACCJ. 

5 

Sum (N) is *:|enerated by the carry save adder 100 receiving rhe 
bit N from :.he two parzial products PI and P2 and biu N-1 froTTL 
ACCU. 

10 Carry (N) , Carry (N-1), Sum (N) and Sum (N-1) are added t.ocerher 
to generate SK-1 and SN. The carry from the previous column N-2 
is taken into account and is added to S(N-l) . This carry as:-2 is 
the carry generated out of column N-2 when performing the 
addition of Sux (N-2 down to 0) and Carry (N-2 down to 0) . 

15 

V?hen per for 'ling the addition 
SN StJ-1 

3 ca^2 

the bits N and N-1 of the result of the multiply accumula-e 
20 function arc^ obtained. These bits are compared. If they are noc 
equal, there is an overflow as explained above. To speed up 
detection, :;n and SN-1 are first calculated, without taking in-o 
account CN-Ii. According to the values SN and SN-1, it is defined 
which values of CN-2 give an overflow, 

25 

The carry s.rve adder stage 20 will add bits N-1 from the first to 
third registers 14 to 18 to provide a sum outpun Sum (N-1) and a 
carry output Carry (N) as illustrated in Figure 3 by arrows A and 
B respectively. These values are output to the global adder 2 2 
30 and also to a calculation unic 24 via inputs 32 and 34. . 3ics N- 
2 from the first, second and third registers 14 to 18 ars sumrr^ed 
to provide the carry output Carry (N-1) .The bit Carry t'N-1) is 
transmitted to the detection unit 24 via line 32. 

35 The calculation -onit 24 also receives the following inputs: br- N 
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frorr. the first register 14 via line 25; bit N frorn the second 
register IS via line 28 and bit N-l of the third register 18 via 
line 30. Bi'; N-l of the third register is taken into account 
instead of b:.t N. This is because this register is the 
accuxulator and cnly N bits of the accurriulator are taken into 
accour.i:. Bit that is the N+1 bit is a sign extension and so is 
a copy of b:.t N-l. Using these values, che calcularion unit is 
able to cal::ulate the values 5N and SN-1. SN-1 is the sum of 
Sum(N-l) frcTi the carry save adder stage and the Carry (N-1) from 
the carry siave adder stage. Value SN" is the SUT. of Sum(N) ; 
Carry (N) and the carry Resulting from the suTiuuing of Sum (N-1) and 
Carry(N-l) . ^ sum (N) is calculated in uhe calcularion unit 24 by 
adding bit :x from the first and second registers and bit N-1 of 
the third rejgister. Carry (N) is from the carry save adder stage 
20. 



Reference is. made to uhe table 1 which summarises the cases where 
an overflow will occur. 
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Overflow if Cm. 2= 
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0 


1 


Always overflow 


1 


0 


Overflow if 0^-2- 




1 


I 


No overflow 



The calculation unit 24 provides two outputs. The first output 36 
SatifO will be high to indicate that saturation is required if 
the carry c:vJ-2 is zero whereas the second output 3 9 Satifl will 
be high to :.ndicate that saturation will occur if the carry CN-2 
is 1. If s:7 and SN-1 are both 0, then a high output will be 
provided via the second output Satifl. If SN is 0 and SN-1 is 1, 
then a high output will be provided on both the SatifO output 3G 



II 

and zhB Satifl output 3 6 as an overflow will always be provided 
regardless o:: the value of the carry CN-2. If SN has the value 1 
and SN-l is equal to o, then a high output will be provided on 
the first output SatifO 3 6 as an overflow will be provided if the 
5 carry CN-2 is zero. Finally, if s:c and 3N-1 are both 1, then a 
zero outpuu :.s provided via the firs- and second output SatifO or 
Sazifl as nc; overflow -will occur. 

The first a::id second outputs 35 and 33 are input to a first 
10 multiplexer which is controlled by the carry 0^-2 which isj output 
froTT; the glcbal adder If there is saturation, zhe output of 
the first multiplexer 40 prov-des a high signal to a second 
multiplexer .42 from which ever of zhe first and second outputs 36 
and 38 of the calculation units 24 is high. In particular; if the 
15 carry value CN-2 is 1, then the second output 38 Satifl will be 
output by the first multiplexer 40, If the carry value aC-2 is 0, 
then the fiirst output 3 6 SatifO will be output by the first 
multiplexer 40. If there is nc saturation, then thii first 
multiplexer 40 provides a zero ou-puc to the second multiplexer 
20 42. 

In summary, satif l=not (SN) and satifO=3N XOR SN-1 where XOR is an 
exclusive o;: function. 

25 If sat = 0, then the multiplexer 42 will output the Res value. If 
sat=l, then there is an overflow and the value out of the 
multiplexer 42 is the saturaticn value generated by uni'i 44. 

In an alternative embodimer." of the present invention, 
30 not (satif 0) and not (satifl) are generated instead of satif 0 and 
satifl respectively. This provides a saturation signal which is 
the opposite to that of the arrangeT.ent described previously. 
This signal has the value 1 if there is no overflow and 0 if 
there is an overflow. The correct ou-put of the multiplexer 42 is 
35 obtained by permutation the inputs of the multiplexer. The result 
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value is cor:\sccsd on input '1' of the multiplexer 42 and Sat^val 
is connectec to inpu:: '0' of multiplexer 42. This way, w:n.en the 
signal has tr.e value 0 anc there is an overflow, the outpu- of 
the multiplexer 42 is Sat_val . When the signal has the value 1 
5 and there is. no overflov;, then the output of the multiplexer is 
the Res valus , 

Once detec::icn of an cverflcv or saturation situation has 
occurred, i:: is necessary to define the saturation value which 

10 should repl5.ce the ac::ual calculated value. Overflow occurs when 
the result riSS falls outside the permitted range. In the case of 
a negative ^'alue, the saturation value will be -2^'^. In the case 
of a positive value, the saturation value will be 2^"^-i- This is 
determined by the saturation unit 44, The saturation uniz 44 

15 receives th::-es inputs 46 to 50. The first input 46 provides bit 
N-1 of the chird register 28. The second input 4S provides the 
sign of th?2 surr. of Pl-rr2. The chird input 50 prov:,des an 
indication a.s to whether or not a round takes place. i;chen a round 
is perf orme-:'., the N/2 least significant bits are cleared, 

20 

Table 2 surrarises whether or not a positive or negative overflow 
takes place. 



ACCU and ? sign 


No round performed 


Round perfomied 
(adding 2^^^^**") 


ACCU>0,P>a 


Positive OVF 


Positive OVF 


ACCU<0, P<C 


Negative OV? 


Negative OVF 


ACCU>0,P<G 


No possible OVF 


Positive OV? 


ACCU<0, P>0 


No possible OYF 


Positive OV? 



25 Taking into account the range of the accumulator value and of the 
product P, ■:he sign of the two data and if a round is perforrred. 



the cype of overflow condition {positive overflow^ negative 
overflow or no possible overflow) can be determ-nsd. If "he 
accumulation value ACCU and the product P are greater than zero, 
a positive overflow is possible. If the accumulation value ACCU 
5 ana the product P are both less than zero, a negative overflow is 
possible. If the accumulation value ACCU and the produce P have 
the opposite sign, then overflow is not possible without 
rounding. If rounding does cake place then positive overflow rray 
occur. The iiaturation unit 44 provides the saturation value 52 
10 according tc; the inputs 46, 4S and 50- When a negative overflow 
is possible, the value, 52 is -2^"-^. When a positive overflow is 
possible and, there no rounding, the value 52 is 2^'" -1. Wien a 
positive ove'trflow is possible and there is rounding, the value 52 
is 2^-^-2^^\ 

15 

The output :'2 of che saturatiicn unit 44 is input zo the second 
multiplexer 42 along with the output from the global adder stage 
22. If an overflow has occurred as deterrrlned by che calculation 
unit 24, the saturation value will replace the result RES output 

20 by the globr:-.l adder stage 22. If, on the other hand, saturation 
IS not required, then the result RES of the global adder stage 22 
will be output by che second multiplexer 42. The output of the 
first multiplexer 40 will control the output of the second 
multiplexer 42 to select either the output of the saturation unit 

25 44 or the output of the global adder stage 22 depending on 
whether or rot the result determined by the global adder ;3L:age 22 
fall within or outside the permitted range. 

When an overflow occurs, the result RES is replaced by the 
30 saturation ^alue. If a negative overflow occurs, that is the 
result RES is below the minimum value -2^"*, then the saturation 
value is thi:. minimum value that can be expressed in N bits in 2's 
complement: arithmetic, that is -2*^"^, If a positive overflow 
occurs, that is the result RES is greater than the maximum value 
35 2^"^-l,the s*:ituration value is the ir^ximum positive value 2^"^-l if 
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no round has to be performed or if a round has to be per::ormed, 
the maximum positive value wich the N/2 least significant bits 
cleared/ that: is 2^''--2^^^' Thus it is possible to perform a.11 the 
overflow detection calculation in parallel with the calculation 
5 of the resul': R3S (and the carry CN-2) . Then the final detection 
of the overflow can be obtained as soon as CN-2 is available. It 
is done thin way, because CN-2 gives the ability to talce into 
account bits N-2 down to 0 of the operands, but it is the latest 
available signal required for cverflow detection. 

10 

In alternative embodiments of the present invention, it is 
possible to perform the same saturation detection using other 
signals from the registers 14 to lo, zhe carry save adder stage, 
or the global adder. 

15 

It should be appreciated thar embodiments of the present 
invention can be incorporated in a digital signal processor. 
Enibodiments of the present invention car. be incorporated in an 
integrated circuit . 
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